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CROSS-REFERENCE TO RELATED APPLICATIONS 

Priority is claimed under 35 U.S.C. 1 19(e) to U.S. provisional application Serial 
No. 60/410,663 filed on September 12, 2002, the entire contents of which is incorporated herein 
by reference, in its entirety. 

FIELD OF THE INVENTION 

The present invention relates to methods for the design, analysis and/or evaluation 

of nucleic acid molecules, particular oligonucleotide nucleic acids (also referred to as 
"oligomers"). The invention also relates to the design, analysis and/or evaluation of nucleic acids 
for particular uses or applications. For example, in particular embodiments the invention relates 
to methods for designing oligonucleotide probes and primers, e.g., for use in PCR or on 
microarrays. The invention still further relates to systems, including computer systems and 
computer program products which may be used to practice the particular methods of this 
invention and/or to program a computer to implement such methods. 



BACKGROUND OF THE INVENTION 

Hybridization between complementary nucleic acids is an implicit feature in the 

Watson-Crick model for DNA structure that is exploited for many applications of the biological 
and biomedical arts. For example, virtually all methods for replicating and/or amplifying nucleic 
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acid molecules are initiated by a step in which a complementary oligonucleotide (typically 
referred to as a "primer") hybridizes to some portion of a "target" nucleic acid molecule. A 
polymerase then synthesizes a complementary nucleic acid from the primer, using the target 
nucleic acid as a "template". See, Kleppe et ai,J. Mol. Biol 1971, 56:341-361. 
5 One particular application, known as the polymerase chain reaction (PCR), is 

widely used in a variety of biological and medical arts. For a description, see Saiki et ah, 
Science 1985, 230:1350-1354. In PCR, two or more primers are used that hybridize to separate 
regions of a target nucleic acid and its complementary sequence. The sample is then subjected to 
multiple cycles of heating and cooling, repeatedly hybridizing and dissociating the 

10 complementary strands so that multiple replications of the target nucleic acid and its complement 
are performed. As a result, even very small initial quantities of a target nucleic acid may be 
enormously increased or "amplified" for subsequent uses {e.g., for detection, sequencing, etc.). 

Multiplex PCR is a particular version of PCR in which several different primers 
are used to amplify and detect a plurality of different nucleic acids in a sample - usually ten to a 

1 5 hundred different target nucleic acids. Thus, the technique allows a user to simultaneously 

amplify and evaluate large numbers of different nucleic acids simultaneously in a single sample. 
The enormous benefits of high throughput, speed and efficiency offered by this technique has 
made multiplex PCR increasingly popular. However, achievement of successful multiplex PCR 
usually involves empirical testing as existing computer programs that pick and/or design PCR 

20 primers have errors. In multiplex PCR, the errors become additive and therefore good results are 
seldom achieved without some amount of trial and error. Markouatos et al.,J. Clin. Lab Anal. 
2002, 16(1):47-51; Henegarin et ai, Biotechniques 1997, 23(3):504-l 1. 
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Other techniques that are widely used in the biological and medical arts exploit 
nucleic acid hybridization to detect target nucleic acid sequences in a sample. See, for example, 
Southern, J. Mol. Biol. 1975, 98:503-517; Denhardt, Biochem. Biophys. Res. Commun. 1966, 
23:641-646; Meinhoth & Wahl, Anal Biochem. 1984,138:267-284. For instance, Southern 
5 blotting and similar techniques have long been used in which nucleic acid molecules from a 
sample are immobilized onto a solid surface or support (e.g., a membrane support). A target 
nucleic acid molecule of interest may then be detected by contacting one or more complementary 
nucleic acids (often referred to as a nucleic acid "probes") and detecting their hybridization to 
nucleic acid molecules on the surface or support (for example, through a signal generated by 

10 some detectable label on the probes). 

Similar techniques are also known in which one or more nucleic acid probes are 
immobilized onto a solid surface or support, and a sample of nucleic acid molecules is 
hybridized thereto. Nucleic acid arrays, for example, are known and have become increasingly 
popular in the art. See, e.g., DeRisi et al, Science 1997, 278:680-686; Schena et al, Science 

15 1995, 270:467-470; and Lockhart et al, Nature Biotech. 1996, 14:1675. See also, U.S. Patent 
No. 5,510,270 issued April 23, 1996 to Fodor et al Nucleic acid arrays typically comprise a 
plurality (often many hundreds or even thousands) of different probes, each immobilized at a 
defined location on the surface or support. A sample of nucleic acids (for example, an mRNA 
sample, or a sample of cDNA or cRNA derived therefrom) that are preferably detectably labeled 

20 may then be contacted to the array, and hybridization of those nucleic acids to the different 

probes maybe assessed, e.g., by detecting labeled nucleic acids at each probe's location on the 
array. Thus, hybridization techniques using nucleic acid arrays have the potential for 
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simultaneously detecting a large number of different nucleic acid molecules in a sample, by 
simultaneously detecting their hybridization to the different probes of the array. 

The successful implementation of all techniques involving nucleic acid 
hybridization (including the exemplary techniques described, supra) is dependent upon the use 
5 of nucleic acid probes and primers that specifically hybridize with complementary nucleic acids 
of interest while, at the same time, avoiding non-specific hybridization with other nucleic acid 
molecules that may be present. For a review, see Wetmur, Critical Reviews in Biochemistry and 
Molecular Biology 1991, 26:227-259. These properties are even more critical in techniques, 
such as multiplex PCR and microarray hybridization, where a plurality of different probes or 

10 primers is used, each of which is preferably specific for a different target nucleic acid. 

Duplex stability between complementary nucleic acid molecules is frequently 
expressed by the duplex's "melting temperature" (T m ). Roughly speaking, the T m indicates the 
temperature at which a duplex nucleic acid dissociates into single-stranded nucleic acids. 
Preferably, nucleic acid hybridization is performed at a temperature slightly below the T m so that 

15 hybridization between a probe or primer and its target nucleic acid is optimized, while 

minimizing non-specific hybridization of the probe or primer to other, non-target nucleic acids. 
Duplex stability and T m are also important in applications, such as PCR, where thermocycling 
may be involved. During such thermocycling steps, it is important that the sample temperature 
be raised sufficiently above the T m so that duplexes of the target nucleic acid and its complement 

20 are dissociated. In subsequent steps of reannealing, however, the temperature must be brought 
sufficiently below the T m that duplexes of the target nucleic acid and primer are able to form, 
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while still remaining high enough to avoid non-specific hybridization events. For a general 
discussion, see Rychlik et ai, Nucleic Acids Research 1990, 18:6409-6412. 

Traditionally, theoretical or empirical models that relate duplex stability to 
nucleotide sequence have been used to predict or estimate melting temperatures for particular 
5 nucleic acids. For example, Breslauer et ai (Proa Natl. Acad ScL U.S.A. 1986, 83:3746-3750) 
describe a model for predicting melting temperatures that is widely used in the art, known as the 
"nearest neighbor model". See also, SantaLucia et ah, Biochemistry 1996, 35:3555-3562; and 
SantaLucia, Proc. Natl Acad. ScL U.S.A. 1998, 95:1460-1465. Such models are usually 
calibrated or optimized for particular salt conditions, typically 1 M Na + . However, applications 

10 that exploit nucleic acid hybridization may be implemented in a variety of different salt 

conditions, with cation concentrations typically being on the order of magnitude of 10-100 mM. 
Thus, melting temperatures for particular probes or primers in an assay are typically predicted by 
predicting a melting temperature at a first salt concentration using the nearest neighbor or other 
model, and then using another theoretical or empirical model to predict what effect(s) the salt 

1 5 conditions of the particular assay will have on that melting temperature. 

Most, if not all of the existing models used to estimate T m treat the effects of salt 
concentration as being separate from and independent of the nucleotide sequence. For example, 
Schildkraut et al {Biopolymers 1965, 3:195-208) proposed the following formula to estimate 
nucleic acid melting temperatures at different sodium ion concentrations, [Na + ]: 

20 

^([Na + ]) = T m ° + 16.6 x log[Na + ] (Equation 1.1) 
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where T m is the melting temperature of the DNA duplex in 1 M sodium ions. Equation 1.1, 
above, is based on empirical data from the specific study of Escherichia coli genomic DNA in 
buffer of between 0.01-0.2 M Na + . Nevertheless, the use of this equation has been routinely 
generalized to model any DNA duplex oligomer pair. See, for example, Rychlik et ai, Nucleic 
5 Acids Res. 1990, 18:6409-6412, Ivanov & AbouHaidar, Analytical Biochemistry 1995, 232:249- 
251; Wetmur, Critical Review in Biochemistry and Molecular Biology 1991, 26:227-259. 

There is evidence, however, indicating that the effects of salt concentration on the 
melting temperature of nucleotide duplexes are not sequence independent but, rather, depend 
substantially on sequence composition of the particular nucleic acids. For a review see, 
10 Bloomfield et a/., Nucleic Acids: Structure, Properties, and Functions (University Science 
Books, Sausalito California 2000): pages 307-308. For example, Owen et al. 
(Biopolymers 1969, 7:503-516) have proposed one empirical formula, based on melting 
experiments of bacterial DNA, that relates melting temperature (T m ) of long polymeric DNAs to 
log[Na + ] and the nucleic acid's G-C content,/(G-C): 

15 

y(G-C) = tan(70.077 + 3.32 x log[Na + ]) x (T m - 175.95) + 260.34 (Equation 1.2) 

Still others (Frank-Kamenetskii, Biopolymers 1971, 10:2623-2624) have reanalyzed the same 
experimental data and suggested simplified equations, purportedly reflecting the linear 
20 dependence of melting temperature on log[Na + ] : 

T m = 1 76.0 - (2.60 -/(G-C) ) x (36.0 - 7.04 x log[Na + ] ) (Equation 1 .3) 
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Doktycz et al. (Biopolymers 1992, 32:849-864) have applied Equation 1.3, above, 
to estimate the salt dependence of T m for average G-C and A-T base pairs in a DNA duplex, and 
concludes that the dependence is governed by different equations for each type of base pair. 
Blake & Delcourt (NucL Acids Res. 1998, 26:3323-3332; Corrigendum, NucL Acids Res. 1999, 
5 27, No. 3) also report that the rate at which T m changes as a linear function of log[Na + ] varies 
with each nearest neighbor, based on melting curves of synthetic tandemly repeating nucleic acid 
inserts in recombinant pN/MCS plasmids. However, their experiments were conducted in the 
narrow range of Na + concentrations from 34 mM to 1 14 mM. 

Rouzina & Bloomfield {Biophysical Journal 1999, 77:3242-3251) have also 

10 analyzed melting data from large, polymeric DNA molecules and propose an alternative 

interpretation for the salt dependence of melting temperatures. In particular, the publication 
suggests a new explanation of empirical Frank - Kamenetskii's relationship (Equation 1 .3) that 
salt dependence of T m may be due to small differences between the heat capacities of duplex and 
single-stranded nucleic acid molecules in solution. The publication suggests that this effect may 

15 be at least partially sequence dependent. Yet, no new relationship between nucleotide sequence 
and the effect is proposed or suggested. 

Finally, Owczarzy et al, Biopolymers 1997, 44:217-239 describe experiments 
evaluating melting temperatures for oligonucleotide duplexes with various G-C content, /(G-C). 
However, melting temperatures were evaluated at only two concentrations of sodium ions, 1 M 

20 and 1 1 5 mM. Consequently, the publication provides an equation that only predicts T m values 
between those two conditions. 
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Despite the existence of such data, sequence-independent formulas such as 
Equations 1 .1, supra, are still used in the art to estimate salt-corrected melting temperatures. For 
instance, as recently as 1998 SantaLucia et al (Proc. Natl. Acad. Set U.S.A. 1998, 95:1460- 
1465) have advocated formulas that estimate salt dependence of a melting temperature by 
5 assuming the effects are sequence independent. Thus, even though there may be data suggesting 
that the effects of salt on a nucleic acid's melting temperature depend on the nucleotide sequence, 
the available data is incomplete and, in many instances, obtained under conditions which are, at 
best, remote from those of biological or biomedical techniques that involve nucleic acid 
hybridization. Specifically, effects of sodium ions on T m have been systematically studied only 

10 for long DNA polymers and DNA dumbbells. See Blake & Delcourt, Nucl Acids Res. 1998, 
26:3323-3332 and Doktycz et al (Biopolymers 192, 32:849-864). As a result, the exact effect 
salt conditions will have on a probe or primer's melting temperature in such assays remains 
poorly characterized and unknown. Consequently, currently available methods for estimating 
melting temperatures of particular probe or primer sequences in hybridization assays are 

1 5 inaccurate and unreliable. 

Yet, given the prevalence and importance of such assays in the biological and 
biomedical arts, there is a significant need for methods of estimating and predicting melting 
temperatures with improved accuracy. In particular, there is a need for methods which predict or 
estimate the melting temperature for a nucleic acid, particularly for an oligonucleotide {e.g., an 

20 oligonucleotide probe or primer) in a PCR or other assay that involve nucleic acid hybridization. 
There exists, moreover, a need for reliable and accurate methods that estimate effects of 
changing salt concentration on the melting temperature of particular nucleic acid sequences. 
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There further exists a need for methods of designing oligonucleotides, e.g., as probes or primers 

for a particular hybridization, PCR or other method, in which the melting temperature of each 

oligonucleotide is optimized for the particular method or assay. 

***** 

5 The citation or discussion of any reference in this section or elsewhere in the 

specification is made only to clarify the description of the present invention and is not an 
admission that any such reference is "prior art" against any invention described herein. 

SUMMARY OF THE INVENTION 

10 Applicants have discovered a method for estimating a melting temperature for a 

polynucleotide and its complementary sequence in a particular salt condition. The method is 
directed to obtaining a reference melting temperature at a reference salt concentration and then 
calculating, from the reference temperature in a manner that is dependent on the G-C content of 
the polynucleotide, a new salt dependent "corrected" melting temperature. The invention 

1 5 provides a novel method for reliably estimating melting temperatures for polynucleotides at a 
desired salt concentration. The method is straightforward and is computationally tractable. 

Accordingly, a skilled artisan can readily use the method to estimate 
polynucleotide melting temperatures under particular salt conditions and/ or adjust salt 
conditions for an assay. In addition, a skilled artisan can readily use the method to estimate 

20 melting temperatures for a variety of different polynucleotide probes and/or primers in desired 
salt conditions and those probes and/or primers having optimal melting temperatures may then 
be selected. 
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The method therefore provides for obtaining a reference melting temperature for a 
particular nucleic acid at a reference salt concentration. Accordingly, a skilled artisan can 
readily use theoretical, empirical or semi-empirical methods to obtain an accurate or reliable 
reference melting temperature. The method also provides for a desired salt concentration. A 
5 skilled artisan will readily obtain a desired salt concentration based upon the polynucleotide 

melting conditions of interest to the artisan. More specifically, the method provides for using the 
reference polynucleotide melting temperature to estimate or determine a salt-dependent, 
"corrected" melting temperature in a manner that is dependent on the G-C content of the 
polynucleotide. It is further provided that the G-C content can be used by a skilled artisan to 

10 determine a G-C content value. 

The method of the present invention provides for the use of formulas which may 
be used to implement the method of the invention. The formulas elucidate the relationship of the 
reference melting temperature, the desired salt concentration and the G-C content value in the 
estimation of the salt "corrected" melting temperature. Accordingly a skilled artisan may readily 

1 5 estimate the desired melting temperature of a polynucleotide using the method of the present 
invention. In addition, optimized coefficients derived from experimentally measured data are 
provided for use with the formulations. 

Computer systems are also provided that may be used to implement the analytical 
methods of the invention, including methods of estimating a salt-corrected melting temperature 

20 of a polynucleotide. These computer systems comprise a processor interconnected with a 

memory that contains one or more software components. In particular, the one or more software 
components include programs that cause the processor to implement steps of the analytical 
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methods described herein. The software components may comprise additional programs and/or 
files including, for example, sequence or structural databases of polymers. 

Computer program products are further provided, which comprise a computer 
readable medium, such as one or more floppy disks, compact discs (e.g., CD-ROMS or RW- 
5 CDS), DVDs, data tapes, etc., that have one or more software components encoded thereon in 
computer readable form. In particular, the software components may be loaded into the memory 
of a computer system and may then cause a processor of the computer system to execute steps of 
the analytical methods described herein. The software components may include additional 
programs and/or files including databases, e.g., of polymer sequences and/or structures. 

10 

BRIEF DESCRIPTION OF DRAWINGS 
FIG. 1 A shows a UV-melting curve for a 2 (iM solution of the oligonucleotide 

5 '-TAACC ATACTGAATACCTTTTGACG-3 ' (SEQ ID NO: 45) and its complement dissolved 

in 68.9 mM Na + buffer. 

15 

FIG. IB shows a differential scanning calorimetry (DSC) curve for a 90 |uM 
solution of the oligonucleotide 5 '-TAACCATACTGAATACCTTTTGACG-3 ' (SEQ ID NO:45) 
and its complement dissolved in 68.9 mM Na + buffer. 

20 FIG. 2 shows an exemplary computer system that may be used to implement the 

analytical methods of the invention. 
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FIG. 3 shows the slope obtained for each of a plurality of different oligomers 
(SEQ ID NOS:1-80) whose experimentally determined melting temperatures (r„/s) are fit to a 
linear function of log[Na + ], plotted as a function of each oligomer's G-C content J{G-C). Melting 
temperatures for each oligomer were measured in estimated sodium cation concentrations of 
5 68.9, 220, 621 and 1020 mM and are set forth in Table I, infra. 



FIG. 4 shows the slopes, as a function of G-C content /(G-C), obtained where the 
inverse of experimentally determined melting temperatures {i.e. \/T m ) for each of the plurality of 
different oligomers (SEQ ID NOS:1-80) are fit to a linear function of log[Na + ]. 

10 

DETAILED DESCRIPTION OF THE INVENTION 



Definitions : 

The term "melting profile" refers to a collection of measurements of an 
1 5 oligonucleotide and its complement which indicate the oligonucleotide molecule's transition 

from double-stranded to single-stranded nucleic acid (or vice-versa). The transition of a nucleic 
acid from double-stranded to single- stranded is often described in the art as the "melting" of that 
nucleic acid molecule. The transition may also be described as the "denaturation" or 
"dissociation" of the nucleic acid. Accordingly, a melting profile of the present invention may 
20 also be referred to as a "dissociation profile", a "denaturation profile", a "melting curve", a 
"dissociation curve", etc. 
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The term "salt concentration" is interchangeably used with the term "ion 
concentration" and refers, specifically, to the concentration of cations {i.e., positively charged 
ions within a sample). Types of ions include, but are not limited to, lithium, potassium, sodium, 
rubidium, cesium and francium. Ions may carry a single or multiple charge. A preferred 
5 embodiment of the invention is the use of monovalent ions. It is preferred that the ion 

concentration ranges from about 1 mM to about 5 M. It is more preferred, however, that the ion 
concentration range from about 5 mM to about 2 M. In particularly preferred embodiments, the 
ion concentration ranges from about 70 mM to about 1020 mM. 

The "melting temperature" or "T m " of a nucleic acid molecule generally refers to 

1 0 the temperature at which a polynucleotide dissociates from its complementary sequence. 

Generally, the T m may be defined as the temperature at which one-half of the Watson-Crick base 
pairs in duplex nucleic acid molecules are broken or dissociated {i.e., are "melted") while the 
other half of the Watson-Crick base pairs remain intact in a double stranded conformation {i.e., 
the fraction of broken based pairs, 0{T) = 0.5 when T = T m ). In preferred embodiments where 

1 5 duplex nucleic acid molecules are oligonucleotides and in other embodiments where the duplex 
nucleic acids dissociate in a two-state fashion, the T m of a nucleic acid may also be defined as the 
temperature at which one-half of the nucleic acid molecules in a sample are in a single-stranded 
conformation while the other half of the nucleic acid molecules in that sample are in a double- 
stranded conformation. T m , therefore defines a midpoint in the transition from double-stranded 

20 to single-stranded nucleic acid molecules (or, conversely, in the transition from single- stranded 
to double-stranded nucleic acid molecules). 
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It is well appreciated in the art that the transition from double-stranded to single- 
stranded nucleic acid molecules does not occur at a single temperature but, rather, over a range 
of temperatures. Nevertheless, the T m provides a convenient measurement for approximating 
whether nucleic acid molecules in a sample exist in a single- stranded or double-stranded 
5 conformation. As such, the melting temperature of a nucleic acid sample may be readily 
obtained by simply evaluating a melting profile for that sample. 

The methods and algorithms of this invention involve calculating estimated 
melting temperatures for complementary nucleic acids and can be applied generally to any of the 
various types of nucleic acids, including but not limited to DNA, RNA, mRNA, cDNA, and 

10 cRNA, as well as "protein nucleic acids" (PNA) formed by conjugating bases to an amino acid 
backbone. Polynucleotides that may be used in accordance with the present invention also 
include double stranded DNA and RNA duplex oligomers, single stranded DNA and RNA, as 
well as backbone modifications thereof (for example, methylphosphonate linkages). This also 
includes nucleic acids containing modified bases, for example, thio-uracil, thio-guanine and 

15 fluoro-uracil. 

The nucleic acids may also be modified by many means known in the art. Non- 
limiting examples of such modifications include methylation, "caps", substitution of one or more 
of the naturally occurring nucleotides with an analog, and internucleotide modifications such as, 
for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, 
20 phosphoroamidates, carbamates, etc.) and with charged linkages {e.g., phosphorothioates, 
phosphorodithioates, etc.). Polynucleotides may contain one or more additional covalently 
linked moieties, such as proteins {e.g., nucleases, toxins, antibodies, signal peptides, poly-L- 
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lysine, etc.), intercalators (e.g., acridine, psoralen, eta), chelators (e.g., metals, radioactive 
metals, iron, oxidative metals, etc.) and alkylators to name a few. The polynucleotides may be 
derivatized by formation of a methyl or ethyl phosphotri ester or an alkyl phosphoramidite 
linkage. Furthermore, the polynucleotides herein may also be modified with a label capable of 
5 providing a detectable signal, either directly or indirectly. Exemplary labels include 
radioisotopes, fluorescent molecules, biotin and the like. 

As used herein, the terms "polynucleotide" and "oligonucleotide" are 
interchangeable and are generally used to describe nucleic acid polymers typically having no 
more than about 500 base pairs. In preferred embodiments, the present invention is practiced 

10 using oligonucleotides between about 5 and 150 nucleotides in length, and more preferably 
between about 10 and 30 nucleotides in length. Oligonucleotides used in the present invention 
may hybridize to any type of nucleic acid from any source; including but not limited to genomic 
DNA, mRNA, cDNA, Expressed Sequence Tags (ESTs), and chemically synthesized nucleic 
acids. Oligonucleotides of the invention may also hybridize to other oligonucleotide molecules. 

15 Oligonucleotides and other polynucleotides can be labeled, e.g., with 32 P- 

nucleotides or nucleotides to which a label, such as biotin or a fluorescent dye (for example, Cy3 
or Cy5) has been covalently conjugated. Generally, oligonucleotides are prepared synthetically, 
preferably on a nucleic acid synthesizer. Accordingly, oligonucleotides can be prepared with 
non-naturally occurring phosphoester analog bonds, such as thioester bonds, etc. 

20 A pair of hybridized polynucleotides may be complementary along their entire 

length or, alternatively, along only a part of their sequence. In preferred embodiments, all of the 
nucleotides in a pair of hybridized oligonucleotides are complementary. However, mismatch 
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base pairing between complementary nucleic acids may occur, and such nucleic acids are 
therefore said to be less than 100% complementary. In particular, the extent of complementarity 
is usually indicated by the fraction ( e.g. , the percentage) of mismatched base pairs out of the total 
number of base pairs in the complementary polynucleotides. It is very preferred that there is at 
5 least 99% complementarity between the polynucleotide and its complementary sequence. 

However, less complementarity may be acceptable or even desirable in some embodiments. For 
example, in some embodiments, the level of complementary may be as low as 95%, 85% or 
75%. 

In preferred embodiments, the terms "about" and "approximately" shall generally 
10 mean an acceptable degree of error for the quantity measured given the nature or precision of the 
measurements. Typical, exemplary degrees of error are within 20 percent (%), preferably within 
10%, and more preferably within 5% of a given value or range of values. Alternatively, and 
particularly in biological systems, the terms "about" and "approximately" may mean values that 
are within an order of magnitude, preferably within 5-fold and more preferably within 2-fold of a 
1 5 given value. Numerical quantities given herein are approximate unless stated otherwise, 

meaning that the term "about" or "approximately" can be inferred when not expressly stated. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such 
as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule 
can anneal to the other nucleic acid molecule under the appropriate conditions of temperature 
20 and solution ionic strength (see Sambrook et al, supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. 
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Hybridization requires that the two nucleic acids contain complementary 
sequences. However, mismatches between bases are possible depending on the stringency of the 
hybridization conditions. The appropriate stringency for hybridizing nucleic acids depends on 
the length of the nucleic acids and the degree of complementarity, variables well known in the 
5 art. The greater the degree of similarity or homology between two nucleotide sequences, the 
greater the value of T m for duplexes of nucleic acids having those sequences. For duplexes of 
greater than 100 nucleotides in length, equations for calculating T m have been derived (see 
Sambrook et ah, supra, 9.50-9.51). For hybridization with shorter nucleic acids, i.e., 
oligonucleotides, the position of mismatches becomes more important, and the length of the 

10 oligonucleotide determines its specificity (see Sambrook et al, supra, 1 1 .7-1 1 .8). A minimum 
length for a hybridizable nucleic acid is at least about 10 nucleotides; preferably at least about 15 
nucleotides; and more preferably the length is at least about 20 nucleotides. Unless other 
conditions are specified, the term "standard hybridization conditions" refers to a T m of about 
55°C, and utilizes conditions as set forth above. In a preferred embodiment, the T m is 60°C; in a 

15 more preferred embodiment, the T m is 65°C. In a specific embodiment, "high stringency" refers 
to hybridization and/or washing conditions at 68°C in 0.2x SSC, at 42°C in 50% formamide, 4x 
SSC, or under conditions that afford levels of hybridization equivalent to those observed under 
either of these two conditions. 

Suitable hybridization conditions for oligonucleotides {e.g., for oligonucleotide 

20 probes or primers) are typically somewhat different than for full-length nucleic acids (e.g., full- 
length cDNA), because of the oligonucleotides' lower melting temperature. Because the melting 
temperature of oligonucleotides will depend on the length of the oligonucleotide sequences 
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involved, suitable hybridization temperatures will vary depending upon the oligonucleotide 
molecules used. Exemplary temperatures may be 37°C (for 14-base oligonucleotides), 48°C (for 
17-base oligonucleotides), 55°C (for 20-base oligonucleotides) and 60°C (for 23-base 
oligonucleotides). Exemplary suitable hybridization conditions for oligonucleotides include 
5 washing in 6x SSC/0.05% sodium pyrophosphate, or other conditions that afford equivalent 
levels of hybridization. 

Nucleic acids can be purified by precipitation, chromatography (including 
preparative solid phase chromatography), oligonucleotide hybridization, ultracentrifugation, and 
other means. In one method, nucleic acids are purified using polyacrylamide gel purification 
1 0 (PAGE) techniques. In another preferred embodiment, they are purified using high pressure 
liquid chromatography (HPLC). Such methods of purification are also well known in the art. 

General Methods : 

In accordance with the invention, there may be employed conventional molecular 
15 biology, microbiology and recombinant DNA techniques within the skill of the art. Such 

techniques are explained fully in the literature. See, for example, Sambrook, Fitsch & Maniatis, 
Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (referred to herein as "Sambrook et aL, 
1989"); DNA Cloning: A Practical Approach, Volumes I and II (D.N. Glover ed. 1985); 
20 Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S J. 
Higgins, eds. 1984); Animal Cell Culture (R.I. Freshney, ed. 1986); Immobilized Cells and 
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Enzymes (IRL Press, 1986); B.E. Perbal, A Practical Guide to Molecular Cloning (1984); RM. 
Ausubel et al (eds.), Current Protocols in Molecular Biology?, John Wiley & Sons, Inc. (1994). 

This invention pertains to a method for predicting the melting temperature at a 
specific ion concentration for a polynucleotide having a G-C content. This invention can be 
5 applied to the design of oligonucleotide probes, hybridization and PCR methods, and microarray 
hybridization methods. 

Overview of the Method : 

In accordance with the present invention, methods are provided here for 
10 estimating a melting temperature (T m ) for a polynucleotide or, more specifically, for a 

polynucleotide and its complementary sequence. Such methods are particularly well suited for 
the design of oligonucleotide probes and primers, e.g., for use in biological assays such as PCR 
and nucleic acid hybridization assays. The methods of the invention are robust and 
straightforward, and provide reliable predictions or estimations of melting temperatures for 
1 5 polynucleotides under conditions that are typically used in such assays. In particular, using the 
methods of the invention a skilled artisan may readily determine or estimate melting 
temperatures for polynucleotides under particular salt conditions and/or may adjust salt 
conditions for an assay accordingly. Alternatively, the methods of the invention may be used to 
determine or estimate melting temperatures for a variety of different polynucleotide probes 
20 and/or primers in desired salt conditions, and those probes and/or primers having optimal melting 
temperatures for the assay may then be selected. 
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In its simplest form, the method of the invention comprises a step of obtaining or 
determining a "reference" melting temperature for a polynucleotide in a particular salt 
concentration (i.e., the "reference" salt concentration). The reference temperature may then be 
used in accordance with the present invention to obtain or estimate a "salt-corrected" melting 
5 temperature therefrom. More specifically, the salt-corrected melting temperature may be derived 
from the reference melting temperature according to a relationship that is dependent upon the 
polynucleotide's G-C sequence content. A more detailed description of these methods follows 
below. 

1 0 Reference melting temperature. A reference melting temperature, typically 

denoted here by the symbol T m °, may be readily obtained for a particular nucleic acid using any 
technique known in the art for obtaining or determining melting temperatures. For example, 
melting temperatures may be experimentally determined for one or more polynucleotides (as 
described in the Examples, infra) at some standard or reference salt concentration, and these 

1 5 empirically determined melting temperatures may then be used as reference melting 
temperatures in accordance with the present invention. However, a reference melting 
temperature may also be obtained or provided using theoretical, empirical or semi-empirical 
models that predict melting temperatures at some salt concentration. In particularly preferred 
embodiments, the reference melting temperature for a polynucleotide is obtained using the 

20 "nearest neighbor model", which is well known in the art (see, e.g., Breslauer et ai, Proc. Natl 
Acad. Sci. U.S.A. 1986, 83:3746-3750; and SantaLucia et ai, Proc. Natl. Acad. Sci. U.S.A. 1998, 
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95:1460). However, various other models are known in the art and may also be used in 
accordance with the present invention. 

The exact experimental method, model or formula used to obtain the reference 
melting temperature is not crucial for practicing the invention. For example and as noted above, 
5 the reference melting temperature may be determined empirically; e.g., by using the melting 
temperature of a polynucleotide duplex at some reference salt concentration. However, the 
melting temperature may also be calculated using some theoretical, empirical or semi-empirical 
model. The model will preferably provide an 'accurate' or reliable estimate of the melting 
temperature at some salt concentration for which the model has been optimized. For example, 

10 the nearest neighbor model and many other models for predicting melting temperatures use 

parameters that have been particularly optimized for a 1 M concentration of monovalent cations 
(specifically, for 1 M Na + ). Accordingly, in embodiments where such models are used to obtain 
a reference melting temperature, the reference salt concentration is preferably 1 M. Generally, 
those skilled in the art will readily appreciate for what salt concentrations a method or model for 

1 5 obtaining melting temperatures has been optimized and, accordingly, will be able to use those 
salt concentrations as the "reference" salt concentration (denoted herein by the symbol [X + ]o) for 
practicing the methods of this invention. 

Salt concentration. In accordance with the methods of this invention, the melting 
20 temperature of a polynucleotide may be readily determined for a particular salt concentration 
(denoted [X + ]) of interest to a user. Generally, the salt concentration of interest will correspond 
to salt conditions for a biological assay (e.g. a PCR or hybridization assay) of particular interest 
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to the user. In preferred embodiments of the invention, the salt concentration of interest will be a 
concentration of sodium ions. However, other monovalent cations (e.g., potassium, lithium, 
rubidium, cesium and francium) may be substituted for sodium. In many hybridization assays 
only monovalent cations are present. However, divalent cations such as magnesium may also be 
5 present. If divalent cations are present, melting temperatures may be determined using the 
methods described below for monovalent cations and then adjusted for divalent ion 
concentrations, e.g., using techniques that are already known in the art. See, for example, Ahsen 
et ai, Clinical Chemistry 2001, 47:1956-1961, see also Peyrot N. (2000) "Prediction of Nucleic 
Acid Hybridization: Parameters and Algorithm," Ph.D. Thesis, Wayne State University, Detroit, 
10 Michigan. 

As demonstrated in the Examples, infra, the methods of the invention are robust, 
and may be used reliably to determine melting temperatures for a wide range of different salt 
conditions. Preferred concentrations may be anywhere from about 5 mM to about 2 M, and are 
more preferably between about 50 mM and 1 M. In particularly preferred embodiments, a salt 
1 5 concentration of interest will be between about 70 and about 1 020 mM. However, using 

empirical techniques that are demonstrated in the below examples, a skilled artisan can readily 
optimize the formulas and methods of this invention for any salt concentration or range of salt 
concentrations of interest. Accordingly, the formulas and techniques described here need not be 
limited to the specific ranges of salt concentration used in those examples. 

20 

G-C content value. The invention provides methods and formulas which more 
accurately estimate salt effects on the melting temperature of a polynucleotide. In particular, 
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these methods adjust the "reference" melting temperature T m in a manner that is dependent upon 
the polynucleotide's sequence content, specifically the content of guanine (G) and cytosine (C) 
base pairs that form between a polynucleotide and its complement. Accordingly, the systems 
and methods of the invention also use a value, referred to herein as the "G-C content value" and 
5 denoted by the symbol/(G-C). The G-C content value/(G-C) provides a numerical value which 
is indicative of the number of G-C base pairs formed between a polynucleotide and its 
complementary sequence. In preferred embodiments, the G-C content of a polynucleotide may 
be obtained or provided from the molar fraction of G-C base pairs in the polynucleotide duplex; 
i.e.,f{G-C) = (no. G-C base pairs)/(total no. base pairs). 

10 

Estimating Salt Dependent Effects Of Melting Temperature (T„) : 

In accordance with the present invention, Applicants have discovered novel 
relationships between the melting temperature of a polynucleotide (T m ), the salt concentration 
[X + ] in which the polynucleotide dissociation (or hybridization) occurs, and the polynucleotide's 

1 5 G-C content value/(G-C). Accordingly, the invention provides novel methods for estimating 
melting temperatures using these novel relationships. Generally speaking, a "reference" melting 
temperature T m ° is obtained or provided for the polynucleotide at a "reference" salt concentration 
[X + ] 0 , as described above. The reference melting temperature is then used to calculate a salt- 
corrected T m according to a relationship that has been optimized for the polynucleotide's G-C 

20 content. 

For example, in one preferred embodiment, a salt-corrected melting temperature 
(T m ) may be estimated or obtained from a reference melting temperature (T m °) using the formula: 
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(Equation 5.1) 



In the present invention, the coefficient k is preferably a function of the polynucleotide's G-C 
content value; i.e., of/(G-C). It is noted that for Equations such as Equation 5.1, as well as for 
other equations throughout the specification based on the reciprocal of the melting temperature 
5 (i.e., \IT m ) temperatures should be entered in units of Kelvin. For equations provided in this 
application that involve T m (for example, Equation 5.2) units of Kelvin and degrees Celsius may 
be used interchangeably. Those skilled in the art will be able to readily convert between units of 
Kelvin and other scales for measuring temperature (e.g. degrees Celsius) using formulas that are 
well known and routinely used in the art (for example: K = °C + 273.15). 

10 In many embodiments, the relationship provided in Equation 5.1, supra, may be 

well approximated by a linear function of the reference melting temperature (T m °) rather than of 
its inverse (i.e., l/T m °). Such a relationship is less computationally intensive than Equation 5.1, 
and therefore will be simpler to use. Accordingly, the use of such a linear approximation may be 
preferred, particularly when considering the relatively narrow range of temperatures for which 

15 melting temperatures of nucleic acids are typically be considered; i.e., for physiological 
temperatures, preferably between about 20 and 80 °C (i.e., between about 293 and 353 K). 



temperature (T m ) may be estimated or obtained from a reference melting temperature (T m °) using 
the formula: 



Accordingly, in another preferred embodiment, a salt-corrected melting 



20 



T = T° + V x In 

m m 



(Equation 5.2) 
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which is a linear approximation of Equation 5 A, supra. Again, the coefficient k' is preferably a 
function of the polynucleotide's G-C content value; i.e., of/(G-C). 

The coefficient k (or k') is preferably obtained or provided by a formula of the 

general form: 

5 k = k(f(G - C)) = m • /(G - C) + £ 0 (Equation 5.3) 

In this equation, m and Icq are constant coefficients which may be optimized to 
determine melting temperatures for polynucleotides having different G-C content under the salt 
concentration(s) or range of salt concentrations of interest. For instance, the examples infra 
describe experiments when appropriate values for these coefficients are optimized for both of 

10 Equation 5.1 and 5.2 above, by optimizing the fit quality to melting data for a plurality of 
polynucleotide sequences. Those skilled in the art will appreciate that the exact value of the 
coefficients m and k 0 will depend on which formula (Equation 5.1 or 5.2) is used to estimate or 
obtain the salt-corrected melting temperature. Therefore, the coefficients are preferably 
optimized independently for each formula. 

15 Formulas for estimating or providing a salt-corrected melting temperature (e.g., 

Equations 5.1 and 5.2, above) may be further optimized by the addition of one or more higher 
order polynomial term. For example, and not by way of limitation, Equation 5.1 above may be 
readily modified by the addition of a second order polynomial term, to obtain a formula such as: 

j-= jz+ kx ln:r~+6x (ln 2 [X + ]- ln 2 [Z + ] 0 ) (Equation 5.4) 

mm L JO 

20 As before, the coefficient k is preferably a function of the polynucleotide's G-C content value 
that is optimized for evaluating melting temperatures at the particular salt concentration(s) of 
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interest. As described, supra, {e.g., for Equations 5.1 and 5.2), the coefficient k is preferably 
obtained or provided by a formula of the general form: 

k = k(f(G -C))=m- f(G -C) + k 0 (Equation 5.3) 

where, again, the coefficients m and fc 0 may be optimized to determine melting temperatures for 
5 polynucleotides under the salt concentration(s) or range of salt concentrations of interest. Again, 
the exact value of these coefficients should depend on which formula is used to estimate the salt- 
corrected melting temperature; i.e., the coefficients are preferably optimized independently for 
each formula. 

As another example, Equation 5.2 above may also be readily modified by the 
10 addition of a second order polynomial term, to obtain a formula such as: 

T m = t + *x ln^+ />x (ln 2 [X> ln 2 [^ + ] 0 ) (Equation 5.5) 

i A Jo 

As with the other formulas, the coefficient k is preferably a function of the polynucleotide's G-C 
content value that is optimized for evaluating melting temperatures at the particular salt 
concentration(s) or range of salt concentrations of interest; for example, by optimizing the 

15 coefficient m and k 0 in Equation 5.3, supra. Again, the coefficients are preferably optimized for 
each formula described in this application. 

Equations such as Equation 5.1 - 5.2 and 5.4 - 5.5 above may be still further 
refined by adding additional polynominal terms; e.g., beyond the second order polynomial terms 
exemplified in Equations 5.4 and 5.5. Thus, for example, embodiments of the invention are also 

20 contemplated that may use, e.g., a third order, forth order, and/or even a fifth order polynomial 
term. Those skilled in the art will be able to modify the equations used in this invention to 
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incorporate still higher order polynomial terms; e.g., using routine formulas and methods well 
known in the mathematical arts. 

Implementation Systems and Methods : 
5 Computer System. The analytical methods described herein can be implemented 

by the use of one or more computer systems. FIG. 2 schematically illustrates an exemplary 
computer system suitable for implementation of the analytical methods of this invention. The 
components of the computer system 201 include processor element 202 interconnected with a 
main memory 203. The computer system can other components such as a mass storage device 

10 204 and user interface devices 205 including for example, example, a monitor, a keyboard, 

and/or pointing devices 206 like a mouse or other graphical input device. The computer system 
201 can be linked to a network 207, which can be part of an Ethernet, a local computer system 
(e.g., as part of a local area network or LAN), and/or a wide area communication network 
(WAN) such as the Internet. 

1 5 Typically, one or more software components are loaded into main memory 203 

during operation of computer system 201. Software component 210 represents an operating 
system, which is responsible for managing computer system 201 and its network connections. 
Software component 211 represents common languages and functions in the system to assist 
programs implementing the methods specific to the invention. Equations for practicing the 

20 methods of the invention can also be programmed and implemented using any programmable 
spread sheet software program. Programmable database systems (for example, a SQL database) 
can be used to program and/or implement the equations and methods of this invention. Thus, 
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software component 212 represents the analytic methods of the invention as programmed in an 
appropriate procedural language, symbolic package, or the like. 

Computer Program Products. The invention also provides computer program 
5 products which can be used, e.g., to program or configure a computer system for implementation 
of analytical methods of the invention. A computer program product of the invention comprises 
a computer readable medium such as one or more compact disks {i.e., one or more "CDs", which 
may be CD-ROMs or a RW-CDs), one or more DVDs, one or more floppy disks (including, for 
example, one or more ZIP™ disks) or one or more DATs to name a few. The computer readable 

10 medium has encoded thereon, in computer readable form, one or more of the software 
components 212 that, when loaded into memory 203 of a computer system 201, cause the 
computer system to implement analytic methods of the invention. The computer readable 
medium may also have other software components encoded thereon in computer readable form. 
Such other software components may include, for example, functional languages 211 or an 

1 5 operating system 210. 

System Implementation. In an exemplary implementation, to practice the 
methods of the invention a G-C content value and/or cation concentration may be loaded into the 
computer system 201. For example, the G-C content value may be directly entered by a user 
20 from monitor and keyboard 205 by directly typing a sequence of symbols representing numbers 
{e.g., G-C content value). Alternatively, a user may specify a reference ion concentration, e.g., 
by selecting an ion concentration from a menu of candidate ion concentrations presented on the 
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monitor or by entering an accession number for a ion concentration in a database and the 
computer system may access the selected ion concentration from the database, e.g., by accessing 
a database in memory 203 or by accessing the sequence from a database over the network 
connection, e.g., over the internet. 
5 Finally, the software components of the computer system, when loaded into 

memory 203, preferably also cause the computer system to estimate a melting temperature 
according to the methods described herein. For example, the software components may cause 
the computer system to obtain a reference melting temperature at a particular reference ion 
concentration for the polynucleotide and then use the reference melting temperature to calculate 

1 0 a modified melting temperature for the polynucleotide utilizing the methods described herein. 

Upon implementing these analytic methods, the computer system preferably then 
outputs, e.g., the melting temperature for the polynucleotide at a desired ion concentration. The 
output may be output to the monitor, printed on a printer (not shown), written on mass storage 
204 or sent through a computer network (e.g., the internet or an intranet such as a Local Area 

1 5 Network) to one or more other computers. 

Alternative systems and methods for implementing the analytic methods of this 
invention are also intended to be comprehended within the accompanying claims. In particular, 
the accompanying claims are intended to include the alternative program structures for 
implementing the methods of this invention that will be readily apparent to those skilled in the 

20 relevant art(s). 



EXAMPLES 
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The present invention is also described by means of the following examples. 
However, the use of these or other examples anywhere in the specification is illustrative only and 
in no way limits the scope and meaning of the invention or of any exemplified term. Likewise, 
the invention is not limited to any particular preferred embodiments described herein. Indeed, 
5 many modifications and variations of the invention may be apparent to those skilled in the art 
upon reading this specification and can be made without departing from its spirit and scope. The 
invention is therefore to be limited only by the terms of the appended claims along with the full 
scope of equivalents to which the claims are entitled. 

10 EXAMPLE 1: 

Melting Temperatures of Various Oligomers Measured in Different Salt Conditions : 

This example describes experiments in which melting profiles were measured for 
at least 92 different, exemplary oligonucleotide molecules in various salt concentrations. 
1 5 Melting temperatures are extracted from those profiles for each oligonucleotide at each salt 
concentration observed, and those melting temperatures are provided in the results, infra. 
Sequence information for each of the exemplary oligonucleotides is also provided. 

Materials and Methods 
20 Oligonucleotide synthesis and purification. DNA oligonucleotides (SEQ ID 

NOS:l-92) were synthesized using solid phase phosphoramidite chemistry, deprotected and 

desalted on NAP-5 columns (Amersham Pharmacia Biotech, Piscataway, NJ) according to 

routine techniques (Caruthers et aL, Methods Enzymol 1992, 211 :3-20). The oligomers were 

purified using 20% polyacrylamide gel electrophoresis in lx TBE buffer (50 mM Tris, 50 mM 

-30- 

{M:\3988\100k297usl\00051867.DOC IDIIUIIHEIIMHII } 



boric acid, 1 mM Na 2 EDTA). The purity of each oligomer was determined by capillary 
electrophoresis (CE) carried out on a Beckman PACE 5000 (Beckman Coulter, Inc., Fullerton, 
CA). The CE capillaries had a 100 /xm inner diameter and contained ssDNA 100R Gel 
(Beckman-Coulter). Typically, about 0.6 nmole of oligonucleotide was injected into a capillary, 
5 ran in an electric field of 444 V/cm and detected by UV absorbance at 260 nm. Denaturing Tris- 
Borate-7 M-urea running buffer was purchased from Beckman-Coulter. 

Compound identity was verified by matrix-assisted laser desorption ionization 
time-of-light (MALDI-TOF) mass spectroscopy on a Voyager DE™ Biospectometry™ Work 
station (Applied Biosystems, Foster, CA) following the manufacturer's recommended protocol. 

10 

Preparation ofDNA samples. Melting experiments were carried out in buffer 
containing 3.87 mM NaH 2 P0 4 > 6.13 mM Na 2 HP0 4 , 1 mM Na 2 EDTA and either 50, 100, 200, 
600 or 1000 mM NaCl. 1 M NaOH was used to titrate each solution to pH 7.0. Total sodium 
concentrations were 68.9, 1 19, 220, 621 and 1020 mM. 

1 5 The DNA samples were thoroughly dialyzed against melting buffer in a 28- Well 

Microdialysis System (Invitation Corp., Carlsbad, CA) following the manufacturer's 
recommended protocol. Concentrations of DNA oligomers were estimated from the samples' 
UV absorbance at 260 nm in a spectrophotometer (Beckman Coulter, Inc., Fullerton, CA), using 
extinction coefficients for each oligonucleotide that were estimated using the nearest neighbor 

20 model for calculating extinction coefficients. (See, Warshaw et al., J. Mol Biol 1966, 20:29-38; 
See also, 

http://www.idtdnaxom/program/techbulletins/calculating_Molar_Extinction_Coefficient.asp). 
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Oligomer concentrations were estimated at least twice for each sample. If the estimated 
concentrations for any sample differed more than 4%, the results were discarded and new 
absorbance measurements were performed. 

To prepare oligonucleotide duplexes, complementary DNA oligomers were mixed 
5 in 1:1 molar ratio, heated to 367 K (i.e., 94 °C) and slowly cooled to an ambient temperature. 
Each solution of duplex DNA was diluted with melting buffer to a total DNA concentration (C T ) 
of2/xM, 

Measurement of melting curves. Melting experiments were conducted on a 
10 single beam Beckman DU 650 spectrophotometer (Beckman-Coulter) with a Micro T m Analysis 
accessory, a Beckman High Performance Peltier Controller (to regulate the temperature), and 
either 1 cm or 1 mm path-length cuvettes. Melt data were recorded using a PC interfaced to the 
spectrophotometer. UV-absorbance values at 268 nm wavelength were measured at 0.1 degree 
increments in the temperature range from 383 to 368 K (i.e., 10-95 °C). Both heating (i.e., 
15 "denaturation") and cooling (i.e., "renaturation") transition curves were recorded in each sample 
at a controlled rate of temperature change (24.9 ± 0.3 Kelvin per hour). Sample temperatures 
were collected from the internal probe located inside the Peltier holder, and recorded with each 
sample's UV-absorbance data. Melting profiles were also recorded for samples of buffer alone 
(no oligonucleotide), and these "blank" profiles were digitally subtracted from melting curves of 
20 the DNA samples. To minimize systematic errors, at least two melting curves were collected for 
each sample in different cuvettes and in different positions within the Peltier holder. 
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Determination of melting temperatures. To determine each sample's melting 
temperature, the melting profiles were analyzed using methods that have been previously 
described (see, Doktycz et ai, Biopolymers 1992, 32:849-864; Owczarzy et ai, Biopolymers 
1997, 44:217-239). Briefly, the experimental data for each sample was smoothed, using a digital 
5 filter, to obtain a plot of the sample's UV-absorbance as a function of its temperature. The 

fraction of single-stranded oligonucleotide molecules, #, was then calculated from that plot. The 
"melting temperature" or u T m H of a sample was defined as the temperature where d = 0.5. 

Results: 

10 To evaluate the effects of changing salt concentration on an oligonucleotide's 

melting temperature (T m ), a plurality of different oligonucleotide molecules (also referred to as 
"oligomers") were synthesized and melting profiles were obtained for each oligomer under 
different salt conditions. The term "melting profile" refers to a collection of measurements of an 
oligonucleotide and its complement which indicate the oligonucleotide molecule's transition 

1 5 from double-stranded to single-stranded nucleic acid (or vice-versa). The transition of a nucleic 
acid from double-stranded to single-stranded is often described in the art as the "melting" of that 
nucleic acid molecule. The transition may also be described as the "denaturation" or 
"dissociation" of the nucleic acid. Accordingly, a melting profile of the present invention may 
also be referred to as a "dissociation profile", a "denaturation profile", a "melting curve", a 

20 "dissociation curve", etc. 

It is well known in the art that a sample of double-stranded nucleic acid molecules 
will absorb less UV-light than an equivalent sample of single-stranded nucleic acid molecules. 
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Thus, in one preferred embodiment a melting profile may comprise a collection of measurements 
indicating the UV absorption of a nucleic acid sample over a range of temperatures. Such a 
collection of measurements was obtained for the melting profiles in this Example, following the 
procedures described in Section 6.1.1, supra. In such a melting profile, an increase in UV- 
5 absorption as the temperature increases will indicate the extent to which more and more base 
pairs of duplex nucleic acid molecules in the sample are dissociating and an increasing fraction, 
0, of those molecules are present in a single-stranded conformation. Conversely, a decrease in 
UV-absorption as the temperature decreases indicates that more and more base pairs are forming 
in the sample so that the fraction of double stranded nucleic acid molecules (1 - d) in the sample 

10 is increasing while the fraction of single-stranded nucleic acid molecules (6) is decreasing. 

The ''melting temperature" or 4 T m " of a nucleic acid molecule refers to the 
temperature above which the nucleic acid may generally be regarded as existing in a single- 
stranded conformation and below which the nucleic acid may generally be regarded as existing 
in a double-stranded confirmation. It is well appreciated in the art that the transition from 

1 5 double-stranded to single-stranded nucleic acid molecules does not occur at a single temperature 
but, rather, occurs over a range of temperatures (e.g., typically a range between about 5 and 
15 °C ). Nevertheless, the T m provides a convenient measurement for approximating whether 
nucleic acid molecule in a sample exist in a single-stranded or double-stranded conformation. 

As an example, FIG 1 A shows an exemplary "UV -melting curve' 5 for a 2 \iM 

20 solution of the oligonucleotide 5 '-TAACCATACTGAATACCTTTTGACG-3 ' (SEQ ID NO:45) 
and its complement dissolved in 68.9 mM Na + buffer. This "melting curve" was obtained as 
described in the Material and Method section, supra. Because the solution absorbs more UV- 
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light (260 nm) when the nucleic acid molecules are in a single-stranded conformation then when 
they are in a double-stranded conformation, the UV-melting curve in FIG. 1 A actually monitors 
the oligonucleotide's transition from the double-stranded to the single-stranded conformation. 
Inspection of the UV-melting curve reveals that the transition from double to single-stranded 
5 conformation does not occur completely at a single temperature, but rather takes place across a 
range of temperatures. However, this range is very narrow (e.g., between about 5-15 °C). 
Thus, at temperatures above the center of this transition (e.g., above about 56.5 °C or 329.7 K) 
the oligonucleotides in this sample can generally be regarded as existing in a single-stranded 
conformation, whereas at temperatures below that "melting temperature" the oligonucleotides in 

10 the sample are generally regarded as existing in a double-stranded conformation (i.e., as 
"duplex" oligonucleotide). 

FIG. IB shows data from a differential scanning calorimetry (DSC) experiment 
for a sample of the same oligonucleotide at much higher concentrations (90 solution of SEQ 
ID NO: 45, Q = 180 |iM, in 68.9 mM Na + buffer). For a detailed description of this 

15 experimental technique, see, e.g., Cooper, Curr. Opinion Chem. Biol, 1999, 3:557-563; and 
Plum & Breslauer, Curr. Opinion Struct Biol, 1995, 5:682-690. 

The plot in FIG. IB shows the sample's excess heat capacity (AC P ) as the 
temperature is raised from about 293 to about 363 K (i.e., 20 °C to about 90 °C). Heat capacity 
of the sample increases as the oligonucleotide duplexes in that sample undergo a transition from 

20 the double-stranded conformation to the single-stranded conformation. Again, inspection of this 
figure shows that the transition occurs across a finite but narrow range (e.g., about 5-15 
degrees) of temperatures centered at approximately 63 °C, where the heat absorption is maximal. 
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Thus, again, at temperatures above about 63 °C (336 K) the oligonucleotides within this sample 
can generally be regarded as existing in a single-stranded conformation, whereas the 
oligonucleotides may be generally regarded as existing in a double-stranded conformation at 
temperatures below about 63 °C. 
5 The observation that this transition (from double-stranded to single-stranded 

DNA) occurs at a higher temperature for the sample in FIG, IB (approximately 63 °C) than for 
the sample in FIG. 1 A (56.5 °C) may be readily attributed to the higher oligonucleotide 
concentration in FIG. IB (180 )iM vs. 2 juM) which, as is well known in the art, drives the 
equilibrium towards the double-stranded nucleic acid conformation. 

10 Generally, the T m of a nucleic acid may be defined as the temperature at which 

one-half of the Watson-Crick base pairs in the double-stranded nucleic acid molecules are 
broken or dissociated (i.e., they are "melted") while the other half of the Watson-Crick base pairs 
remains intact; i.e., the fraction of broken base pairs, 9(T) = 0.5 when T = T m . However, in many 
cases the transition of oligonucleotides and other nucleic acids considered in this invention is or 

1 5 may be considered as a "two-state" process in which an individual nucleic acid molecule exists 
either as a duplex nucleic acid with all of its base-pairs intact, or as a single-stranded nucleic acid 
with no Watson-Crick base pairs between it and its complementary sequence. The fraction of 
molecules at any given temperature that are partially melted is negligibly small. Accordingly, 
the fraction 0(T) may be considered as the fraction of single-stranded nucleic acid molecules in a 

20 sample of interest, and 1 - 0(T) is considered as the fraction of double-stranded nucleic acid 

molecules in that sample.. In such embodiments, the T m may be equivalently defined as the 

temperature at which one-half of the nucleic acid molecules are in a single-stranded 
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conformation and one-half of the nucleic acid molecules are in a double-stranded conformation. 
The exemplary melting and DSC curves shown in FIGS. 1A and IB, respectively, show that the 
transition of that oligonucleotide (SEQ ID NO:45) is a two-state process, with only a single 
transition between two conformation states: double-stranded and single-stranded DNA. The T m 
5 is the temperature at the midpoint of that transition. Thus, the melting temperature of a nucleic 
acid sample may be readily obtained by evaluating a melting profile for that sample, such as the 
UV-melting curve illustrated in FIG. 1 A or the DSC curve shown in FIG. IB. See, also, the 
Materials and Methods Section, supra. 

Oligonucleotides corresponding to each of the sequences set forth in SEQ ID 

10 NOS:l-92 and their complementary sequences were synthesized and purified according to the 
methods described in the Materials and Methods section, supra. Capillary electrophoresis assays 
confirmed that all samples were more than 90% pure, and experimental molar masses of each 
oligomer were confirmed within 0.4% of their predicted molar mass by mass spectroscopy. For 
the melting experiments, each of the oligonucleotides listed in Table 1, below (SEQ ID NOS: 1- 

1 5 92) was mixed in a 1 : 1 molar ratio with its 100% complementary sequence, as described in 

Material and Methods Section, supra. Melting profiles were then recorded for each oligomer in 
68.9, 1 19, 220, 621 and 1020 mM Na + , and the melting temperature extracted from each profile. 
The experimentally determined T m values for each sample were reproducible within 0.3 C. 

The T m values obtained for each oligomer are provided in Table I, below. For 

20 convenience, the melting temperatures specified in Table I are listed in units of Kelvin (K), 

which are preferably used in the implementation of this invention. However, those skilled in the 
art will be able to readily convert between units of Kelvin and other scales or units for measuring 
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temperature (e.g., degrees Celsius) using formulas that are well known and routinely used in the 
art (for examples K = °C + 273.15). Sequence information was also recorded for each oligomer, 
including the total number of bases (N) and the G-C content. Specifically, an oligomer's G-C 
content/(G-C) is defined here as the fraction of bases that are either guanine or cytosine. Thus, 
5 for example, the oligonucleotide set forth in SEQ ID NO:l comprises a total of 15 bases pairs 
(i.e., N = 15), of which three are either guanine or cytosine. Thus, that particular oligomer's G-C 
content may be obtained or provided by: /(G-C) = 3/15 = 0,2, The nucleotide sequence, total 
number of base pairs and G-C content for each oligomer are also provided in Table I, along with 
the corresponding SEQ ID NO. 

10 

TABLE I: MEASURED MELTING TEMPERATURES AT VARIOUS [Na + ] 
CONCENTRATIONS 



SEQ ID Sequence (Total Na + Concentration) 

NO. N /(G-C) 68.9 mM 119mM 220 mM 621 mM 1020 mM 



1 


TACTAACATTAACTA 


15 


0.20 


308.5 


313.7 


317.3 


322.5 


324.3 


2 


ATACTTACTGATTAG 


15 


0.27 


311.3 


314.7 


318.2 


323.1 


324.7 


3 


GTACACTGTCTTATA 


15 


0.33 


314.2 


318.0 


321.5 


326.1 


328.0 


4 


GTATGAGAGACTTTA 


15 


0.33 


313.1 


317.4 


321.1 


326.5 


328.6 


5 


TTCTACCTATGTGAT 


15 


0.33 


313.8 


317.8 


321.3 


325.4 


326.9 


6 


AGTAGTAATCACACC 


15 


0.40 


317.5 


321.0 


324.8 


329.4 


330.3 


7 


ATCGTCTCGGTATAA 


15 


0.40 


318.7 


322.6 


326.1 


330.6 


331.8 


8 


ACGACAGGTTTACCA 


15 


0.47 


321.0 


324.4 


328.7 


333.0 


334.5 


9 


CTTTCATGTCCGCAT 


15 


0.47 


323.2 


327.1 


330.3 


334.6 


335.9 


10 


TGGATGTGTGAACAC 


15 


0.47 


319.7 


324.8 


327.8 


332.3 


333.5 


11 


ACCCCGCAATACATG 


15 


0.53 


324.52 


328.5 


331.7 


335.6 


336.1 


12 


GCAGTGGATGTGAGA 


15 


0.53 


324.4 


328.0 


331.2 


334.9 


336.5 


13 


GGTCCTTACTTGGTG 


15 


0.53 


321.0 


324.8 


328.3 


332.3 


333.5 
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TABLE I: MEASURED MELTING TEMPERATURES AT VARIOUS [Na 4 ] 
CONCENTRATIONS 



SEQ ID Sequence (Total Na + Concentration) 



NO. 




N 


A.G-C) 


68.9 mM 


119mM 


220 mM 


621 mM 


1020 mM 


14 


CGCCTCATGCTCATC 


15 


0.60 


326.0 


329.9 


333.3 


336.8 


339.0 


15 


AAATAGCCGGGCCGC 


15 


0.67 


332.2 


335.4 


338.5 


342.2 


343.6 


16 


CCAGCCAGTCTCTCC 


15 


0.67 


327.3 


331.2 


334.7 


338.3 


339.9 


17 


GACGACAAGACCGCG 


15 


0.67 


331.1 


334.7 


337.6 


340.8 


341.8 


18 


CAGCCTCGTCGCAGC 


15 


0.73 


334.0 


337.3 


340.6 


343.3 


345.2 


19 


CTCGCGGTCGAAGCG 


15 


0.73 


334.7 


337.8 


340.3 


343.2 


343.9 


20 


GCGTCGGTCCGGGCT 


15 


0.80 


338.1 


340.9 


343.7 


347.1 


347.3 


21 


TATGTATATTTTGTAATCAG 


20 


0.20 


317.6 


320.9 


325.7 


330.8 


334.4 


22 


TTC A A G TT A A A C A TTCT A TC 


20 


0.25 


318.9 


322.7 


327.1 


332.6 


334.7 


23 


TG A TTCT A CCT A TG TG A TTT 


20 


0.30 


322.3 


326.7 


330.6 


335.5 


337.6 


24 


GAGATTGTTTCCCTTTCAAA 


20 


0.35 


322.5 


326.0 


330.8 


335.8 


338.5 


25 


ATGCAATGCTACATATTCGC 


20 


0.40 


328.4 


332.7 


336.1 


340.2 


342.1 


26 


CCACTATACCATCTATGTAC 


20 


0.40 


324.3 


327.8 


331.6 


335.4 


337.6 


27 


CCATCATTGTGTCTACCTCA 


20 


0.45 


328.8 


332.7 


336.4 


340.5 


341.7 


28 


CGGGACCAACTAAAGGAAAT 


20 


0.45 


326.9 


331.0 


334.9 


339.9 


341.7 


29 


TAGTGGCGATTAGATTCTGC 


20 


0.45 


330.2 


333.8 


338.0 


342.4 


344.4 


30 


AGCTGCAGTGGATGTGAGAA 


20 


0.50 


332.9 


336.7 


340.8 


344.5 


346.3 


31 


TACTTCCAGTGCTCAGCGTA 


20 


0.50 


333.5 


337.6 


340.9 


344.8 


346.8 


32 


CAGTGAGACAGCAATGGTCG 


20 


0.55 


333.0 


336.7 


340.2 


344.3 


345.7 


33 


CG AGCTT ATCCCT ATCCCTC 


20 


0.55 


329.2 


333.5 


337.3 


341.5 


343.5 


34 


CGTACTAGCGTTGGTCATGG 


20 


0.55 


332.8 


336.3 


339.8 


343.7 


344.3 


35 


AA(j(j(_OA(j 1 LAOOL 1 CAu 1 U 


20 


0.60 


337.7 


340.9 


344.6 


348.3 


349.5 


36 


ACCGACGACGCTGATCCGAT ] 


20 


0.60 


339.2 


342.3 


345.8 


350.0 


350.5 


37 


AGCAGTCCGCCACACCCTGA 


20 


0.65 


339.7 


343.1 


347.2 


350.1 


351.7 


38 


CAGCCTCGTTCGCACAGCCC 


20 


0.70 


340.4 


343.9 


347.2 


350.9 


351.4 


39 


GTGGTGGGCCGTGCGCTCTG j 


20 


0.75 


342.4 


345.9 


349.4 


352.8 


354.2 


40 


GTCCACGCCCGGTGCGACGG 


20 


0.80 


344.1 


347.1 


350.5 


353.0 


354.3 
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TABLE I: MEASURED MELTING TEMPERATURES AT VARIOUS [Na + ] 
CONCENTRATIONS 

^ ' ~ " ' ~ " Tm (K) 



SEQ ID Sequence (Total Na + Concentration) 



NO. 




N 




68.9 mM 


119mM 


220 mM 


621 mM 


1020 mM 


41 


GATATAGCAAAATTCTAAGTTAATA 


25 


0.20 


322.3 


326.7 


330.9 


336.5 


339.4 


42 


ATAACTTTACGTGTGTGACCTATTA 


25 


0.32 


329.8 


333.9 


337.9 


342.8 


345.0 


43 


GTTCTATACTCTTGAAGTTGATTAC 


25 


0.32 


325.9 


329.3 


333.8 


339.3 


340.9 


44 


CCCTGC ACTTT AACTG A A TTGTTT A 


25 


0.36 


330.7 


334.6 


338.8 


343.3 


345.7 


45 


TAACCATACTGAATACCTTTTGACG 


25 


0.36 


329.7 


333.4 


337.5 


342.1 


344.5 


46 


TCCACACGGTAGTAAAATTAGGCTT 


25 


0.40 


332.5 


336.3 


340.5 


345.0 


347.0 


47 


TTCCAAAAGGAGTTATGAGTTGCGA 


25 


0.40 


332.3 


336.2 


340.4 


344.8 


347.0 


48 


AATATCTCTCATGCGCCAAGCTACA 


25 


0.44 


335.3 


338.9 


343.5 


348.2 


349.7 


49 


TAGTATATCGCAGCATCATACAGGC 


25 


0.44 


334.4 


337.9 


342.3 


346.0 


348.2 


50 


TGG A TTCT A CTCA A CCTT AGTCTGG 


25 


0.44 


332.2 


336.3 


340.3 


344.5 


346.8 


51 


CGGAATCCATGTTACTTCGGCTATC 


25 


0.48 


333.9 


337.9 


341.9 


346.5 


347.9 


52 


CTGGTCTGGATCTGAGAACTTCAGG 


25 


0.52 


335.3 


339.0 


342.8 


347.4 


348.8 


53 


ACAGCGAATGGACCTACGTGGCCTT 


25 


0.56 


341.3 


345.3 


349.2 


352.6 


354.3 


54 


AGCAAGTCGAGCAGGGCCTACGTTT 


25 


0.56 


341.5 


345.8 


349.5 


353.2 


354.7 


55 


GCGAGCGACAGGTTACTTGGCTGAT 


25 


0.56 


340.2 


344.0 


347.9 


351.8 


353.3 


56 


AAAGGTGTCGCGGAGAGTCGTGCTG 


25 


0.60 


342.8 


346.8 


350.6 


354.4 


355.6 


57 


ATGGGTGGGAGCCTCGGTAGCAGCC 


25 


0.68 


343.9 


347.7 


351.4 


354.8 


356.6 


58 


CAGTGGGCTCCTGGGCGTGCTGGTC 


25 


0.72 


345.1 


348.8 


352.3 


355.8 


356.5 


59 


GCCAACTCCGTCGCCGTTCGTGCGC 


25 


0.72 


346.8 


349.6 


353.8 


356.4 


357.8 


60 


ACGGGTCCCCGCACCGCACCGCCAG 


25 


0.80 


350.0 


353.1 


357.2 


360.3 


361.5 


61 


TTATGTATTAAGTTATATAGTAGTAGTA 
GT 


30 


0.20 


323.9 


328.2 


332.5 


338.3 


339.8 


62 


A TTC. ATA TfTTTTTrT A TTf A T^TTTr A T 

A 1 IUAI Al I 1 1 ILIAI 1 L-A i C 1 J I 1 

T 


30 


0.23 


326.7 


331.5 


335.5 


341.8 


343.6 


63 


AAAGTACATCAACATAGAGAATTGCAT 
TTC 


30 


0.30 


331.5 


335.1 


339.3 


344.5 


346.4 


64 


CTTAAGATATGAGAACTTCAACTAATGT 
GT 


30 


0.30 


330.0 


334.5 


338.1 


343.7 


345.0 


65 


CTCAACTTGCGGTAAATAAATCGCTTAA 
TC 


30 


0.37 


334.0 


338.0 


341.9 


347.6 


348.7 


66 


TATTGAGAACAAGTGTCCGATTAGCAG 
AAA 


30 


0.37 


334.5 


338.6 


342.8 


348.0 


349.6 


67 


GTCATACGACTGAGTGCAACATTGTTCA 
AA 


30 


0.40 


335.9 


340.0 


344.0 


349.1 


350.1 
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TABLE I: MEASURED MELTING TEMPERATURES AT VARIOUS [Na + ] 
CONCENTRATIONS 

" " ~ ~~ r m (K) 



SEQ ID Sequence (Total Na + Concentration) 

NO. N y(G-C) 68.9 mM U9mM 220 mM 621 mM 1020 mM 



68 


AACCTGCAACATGGAGTTTTTGTCTCAT 
GC 


30 


0.43 


337.7 


341.5 


345.7 


350.9 


351.9 


69 


CCGTGCGGTGTGTACGTTTTATTCATCA 
TA 


30 


0.43 


337.1 


341.5 


345.0 


349.7 


350.8 


70 


GTTCACGTCCGAAAGCTCGAAAAAGGA 
TAC 


30 


0.47 


337.5 


341.4 


345.3 


350.4 


351.9 


71 


AGTCTGGTCTGGATCTGAGAACTTCAG 
GCT 


30 


0.50 


339.5 


343.6 


347.7 


352.0 


353.8 


72 


TCGGAGAAATCACTGAGCTGCCTGAGA 
AGA 


30 


0.50 


339.5 


343.6 


347.3 


352.2 


354.2 


73 


CTTCAACGGATCAGGTAGGACTGTGGT 
GGG 


30 


0.57 


340.8 


344.9 


347.9 


352.1 


353.3 


74 


ACGCCCACAGGATTAGGCTGGCCCACA 
TTG 


30 


0.60 


344.5 


347.9 


351.7 


355.9 


357.2 


75 


GTTATTCCGCAGTCCGATGGCAGCAGG 
CTC 


30 


0.60 


343.8 


348.1 


351.3 


355.6 


357.3 


76 


TCAGTAGGCGTGACGCAGAGCTGGCGA 
TGG 


30 


0.63 


345.4 


348.9 


352.5 


356.5 


357.8 


77 


CGCGCCACGTGTGATCTACAGCCGTTC 
GGC 


30 


0.67 


345.9 


349.5 


352.7 


356.6 


357.7 


78 


GACCTGACGTGGACCGCTCCTGGGCGT 
GGT 


30 


0.70 


347.7 


351.6 


354.7 


358.4 


359.5 


79 


GCCCCTCCACTGGCCGACGGCAGCAGG 
CTC 


30 


0.77 


349.5 


353.0 


356.8 


360.3 


360.9 


80 


CGCCGCTGCCGACTGGAGGAGCGCGGG 
ACG 


30 


0.80 


351.0 


354.8 


357.8 


360.9 


361.8 


81 


ATCAATCATA 


10 


0.20 


294.5 


297.7 


301.1 


305.6 


306.8 


82 


TTGTAGTCAT 


10 


0.30 


297.8 


301.0 


304.4 


308.0 


309.2 


83 


GAAATGAAAG 


10 


0.30 


295.3 


298.5 


302.3 


306.3 


307 6 


84 


CCAACTTCTT 


10 


0.40 


302.2 


305.3 


309.1 


312.8 


313.8 


85 


ATCGTCTGGA 


10 


0.50 


307.0 


310.6 


313.7 


317 7 


318 1 


86 


AGCGTAAGTC 


10 


0.50 


300.6 


304.3 


307.8 


312.7 


313.4 


87 


CGATCTGCGA 


10 


0.60 


312.4 


315.5 


318.8 


321.6 


322.3 


88 


TGGCGAGCAC 


10 


0.70 


317.6 


321.1 


324.5 


328.2 


328.5 


89 


GATGCGCTCG 


10 


0.70 


317.4 


320.2 


323.3 


326.8 


326.7 


90 


GGGACCGCCT 


10 


0.80 


319.9 


323.5 


326.3 


329.7 


330.2 


91 


CGTACACATGC 


11 


0.55 


313.5 


316.7 


319.3 


322.8 


323.1 


92 


CCATTGCTACC 


11 


0.55 


311.3 


314.9 


317.7 


321.1 


322.1 
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EXAMPLE 2: 

Sequence Dependent Salt Effects on T m 

As an initial evaluation of the effect(s) sequence composition and length may 

have on a nucleic acid's melting temperature T m , the experimentally determined melting 
temperatures for each oligonucleotide in Table I, supra, were fit in a least squares analysis to 
each of the following linear regressions: 

T m = T l + w log[Afa + ] (Equation 6.1) 

— = -~ + m \og[Na* ] (Equation 6.2) 

Equations that are similar in form to Equations 6.1 and 6.2 are well known in the 
art and have been used to predict nucleic acid melting temperatures for specified salt 
concentrations. However, the versions of these formulas that have been previously described use 
coefficients (i.e., for m) that are constants and are independent of (and therefore unaffected by) 
either the nucleic acid's length or base composition. For example, Schildraut et al (Biopolymers 
1965, 3:195-208) describe an equation that is similar to Equation 6.1, above, for estimating a 
nucleic acid's melting temperature as a function of sodium ion concentration: 

T m (Na + ) = T m (lM) + 16.61og[Na + ] (Equation 6.3) 

See also, Rychlik et al, Nucl Acids Res. 1990, 18:6409; Ivanov & AbouHaidar, Analytical 

Biochemistry 1995, 232:249-251; and Wetmur, Critical Review in Biochemistry and Molecular 

Biology 1991, 26:227-259. In the above Equation 6.3, the oligomer's melting temperature in 1 M 
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sodium salt is used as the reference melting temperature (i.e., T Yt 



o 



= r m (lM) ). This reference 



m 



temperature may be measured (e.g., as described above) or it may be calculated or predicted, 
e.g., by the nearest neighbor model of Breslauer et al, Proc. Natl. Acad. Sci. U.S.A. 1986, 
83:3746-3750. See also, SantaLucia et al, Biochemistry 1996, 35:3555-3562; and Santa Lucia, 
5 Proc. Natl. Acad. Sci. U.S.A. 1998, 95:1460-1465. 



value of m = 12.5 rather than 16.6. (See, SantaLucia et al, Biochemistry 1996, 35:3555-3562). 



dissociation, AS 0 , to changing sodium ion concentrations has also been proposed (See, Santa 
10 Lucia, Proc. Natl. Acad Sci U.S.A., 1998, 95:1460-1465): 



where N is the total number of phosphates in the nucleotide duplex divided by two and AS° (1M) 
15 is the entropy of dissociation in 1M Na + , which is preferably calculated using an appropriate 
nearest neighbor model (described above). Equations are also well known in the art that relate 
the melting temperature, T m , to the enthalpy and entropy of oligonucleotide dissociation (AH° 
and AS°, respectively) and the total oligonucleotide strand concentration, C T (for example, see 
Santa Lucia, supra): 



Others have used a formula that is similar to Equation 6.3, above, but using a 



Alternatively, an equation that relates the entropy change of oligonucleotide 



AS 0 (Na + ) = AS° (1M) + 0.368Nln [Na + ] 



(Equation 6.4) 



20 



AH 0 



(Equation 6.5) 



(AS Q +R\nC T /4) 
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If it is assumed that the enthalpy of polynucleotide dissociation does not vary in 
changing salt conditions, then Equations 6.4 and 6.5 above may be readily combined to derive an 
alternative equation relating T m to salt concentration: 

1 1 0.368^/ [No] 

— = — + — x In (Equation 6.6) 

T n T° m A// 0 [Na + ] 0 

5 where standard transition enthalpy, AH°, is preferably calculated using an appropriate nearest 
neighbor model. 

None of the equations described above accounts for any content of the particular 
polynucleotide sequence under consideration. Indeed, existing models of polynucleotide 
dissociation assume that cations stabilize polynucleotide duplexes by partially neutralizing the 
10 negative charges of each strand's phosphate background. (See, Schildkraut et aL, Biopolymers 
1965, 3:195-208; and SantaLucia et aL, Proc. Natl Acad. Sci 1998, 95:1460-1465). Such 
models suggest that any salt effect on polynucleotide dissociation would be sequence 
independent. 

Applicants have determined, however, that the effect(s) of salt concentration on 
1 5 the melting temperature of a nucleic acid is itself dependent on the nucleotide sequence. This 
discovery is readily apparent from the preliminary analysis described in this example. A 
coefficient value m was obtained for each oligomer in Table I when its measured melting 
temperatures are fit to Equation 6.1. The coefficients m thus obtained for each oligonucleotide 
are plotted in FIG. 3, as a function of the oligomer's G-C content /(G-C). If the effects of 
20 changing salt concentration on the melting temperature were in fact sequence independent, as 
suggested by the prior art and by Equation 6.3, the same coefficient m would be obtained for all 
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oligomers and the data plotted in FIG. 3 would form a horizontal line. However, it is apparent 
from inspection of FIG. 3 that the coefficient m actually decreases as the G-C content increases. 

Data points for oligomers of 15, 20, 25 and 30 base pairs in length are distinctly 
labeled in FIG. 3. The coefficient m decreases at a similar rate for oligomers of all these 
5 different lengths. Thus, these data demonstrate that while the effect of changing salt 

concentration depends upon the sequence composition of a nucleic acid, it is substantially 
independent of the nucleic acid's length. 

FIG. 4 shows, as a function of the G-C content J{G-C), the coefficient m obtained 
for each oligomer in Table II when its measured melting temperatures are fit to Equation 6.2, 

10 supra. Again, if the effects of changing salt concentration on melting temperature were actually 
sequence independent (as suggested by formulas used in the prior art) the data plotted in this 
figure would form a horizontal line. In fact, however, the coefficient m increases linearly with 
the oligomers' G-C content. As in FIG. 3, the coefficient m changes at a similar rate with G-C 
content for oligomers of different length. Thus, FIG. 4 also demonstrates that, while the effect 

15 of changing salt concentration depends upon the sequence composition of a nucleic acid, the 
effect is substantially independent of the nucleic acid's length. 

EXAMPLE 3: 

Formulas for Improved Prediction of T m for Different Salt Conditions 

20 To evaluate the effect(s) that differing salt concentrations may have on an 

oligonucleotide's melting temperature, the experimentally determined melting temperatures set 
forth in Table I, supra, were fit to various different equations that predict melting temperature 
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based on: (1) salt concentration (in this example, [Na ]), and (2) oligonucleotide sequence 
content (e.g.,J{G-C) ). Generally speaking, the equations were of one of the following forms: 

T m = C + k(f(G- Q)x In— — + bx ln 2 [Mr + ] (1) - \n 2 [Na*\J (Equation 6.7) 

L yv " J(0) 

J=Jt+ k(f(G- C))x In— — -+bx (\n 2 [Na + ]~ ln 2 [Na + \ 0) ) (Equation 6.8) 

mm L -K0) 

In Equations 6.7 and 6.8, above, T m denotes a melting temperature to be 
5 determined for a nucleic acid in a particular concentration of monovalent cations, [Na + ]. T m ° 
denotes the nucleic acid's melting temperature in some "reference" concentration of the 
monovalent cations [Na + ] (0 ). Typically, a value of 1 M is selected as the reference concentration 
(i.e., [Na + ] (0 ) = 1 M). However, any value may be selected for a reference concentration and a 
skilled artisan practicing this invention will readily appreciate when and what other reference 

1 0 concentration values may be used. 

The reference melting temperature, T m ° may be a melting temperature that is 
experimentally determined (e.g., as described in Section 6.1, above) for the nucleic acid at the 
reference concentration of cations, [Na + ] (0 ). However, the value of T m ° may also be calculated 
using methods, such as the nearest neighbor model, that are well known and routinely used in the 

15 art to determine a nucleic acid's melting temperature at some concentration of cations. For 
instance, the reference melting temperature T m ° may be calculated using a nearest neighbor 
model as described, e.g., by SantaLucia et aL, Proc. Natl. Acad. Set U.S.A. 1998, 95:1460; or, 
less preferably, Breslauer et al.Proc. Natl Acad. Sci. U.S.A. 1986, 83:3746-3750 (see also, the 
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references cited supra in connection with the nearest neighbor model). Preferably, both the 
predicted melting temperature T m and the reference melting temperature T m ° are specified in 
Kelvin (K). 

In embodiments where a reference melting temperature is calculated from a 
5 theoretical model, the parameters of that model will typically have been calibrated, optimized or 
otherwise selected for a particular concentration of cations (e.g., for 1 M Na + ). A skilled artisan 
practicing the invention will appreciate, therefore, that the reference concentration of cations 
used in such embodiments (i.e. the value [Na + ] (0 ) in Equations 6.7 and 6.8, supra) will preferably 
be that value for which the theoretical model's parameters have been evaluated. 

10 In the present example and in most preferred embodiments of the invention, the 

monovalent cations are sodium cations. However, use of the symbol for sodium cations (i.e., 
Na + ) in the above equations is done here merely to simplify the description of this invention. 
The formulas presented here, as well as the algorithms they represent and illustrate, may be used 
to estimate or predict melting temperature in different concentrations of any monovalent cation, 

15 including lithium cations (Li + ), potassium cations (K + ), rubidium cations (Rb + ), cesium cations 
(Cs + ) and francium cations (Fr + ) to name a few. 

As demonstrated in Example 2, above, the effect of salt concentration on the 
melting temperature of a nucleic acid is itself dependent on nucleotide sequence. Moreover, 
Applicants have determined, as demonstrated here, that such sequence-dependent effects may be 

20 accounted for when predicting or estimating T m values, by simply using a coefficient k which is a 
function of the nucleotide sequence content. In particular and in preferred embodiments of the 
invention, the coefficient k may be a function of the nucleic acid's G-C content /(G-C). 
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As used herein to describe the present invention, the G-C content of a nucleic 
acid,/(G-C), refers to the fraction of that nucleic acid's nucleotide bases that are either guanine 
(G) or cytosine (C). Thus, for example, the oligomer set forth in SEQ ID NO:l (see, also, in 
Table I, supra) comprises a total of 15 bases (i.e., n = 15), of which three are either guanine or 
5 cytosine. Thus, that oligomer's G-C content may be obtained or provided by: 
/(G-C) = 3/1 5 -0.2. 

The second term in Equations 6.7 and 6.8 (i.e., the term: b x (ln 2 [Na + ] ( i) - 
ln 2 [Na + ](o)) ) is an exemplary second-order polynominal term. In preferred embodiments, the 
use of this term with an appropriately selected coefficient value b provides for greater accuracy 
10 and greater reliability when estimating the melting temperature for a nucleic acid under 
particular salt conditions. 

In still other embodiments, additional higher order polynomial terms may also be 
used in Equations 6.7 - 6.8, to estimate salt-corrected melting temperatures with even greater 
accuracy and reliability. Thus, the invention also contemplates the optional use of third, forth 
1 5 and/or even fifth order polynomial terms. Those skilled in the art will be able to modify the 

equations used in this invention to incorporate such higher order polynomial terms using routine 
formulas and methods well known in the mathematical arts. 

Those skilled in the art will also recognize that, when higher order polynominal 
terms are used in these equations, it will be necessary to re-optimize the coefficients for optimal 
20 results. Thus, for example, when the second order polynominal term is used in Equation 6.7 

and/or 6.8 (i.e., b ?*0) it may be necessary to select a value for the coefficient k which, generally, 
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will be different from the value selected when such higher order polynominal terms are not used 
(i.e., when b = 0). 

It is also noted that the formulas provided in Equations 6.7 and 6.8 are set forth 
with respect to the "natural logarithm" (i.e., a logarithm of the base e = 2.1718) of a cation 
5 concentration or of a ratio of cation concentrations. As a skilled user will readily appreciate, it 
may be preferable in many instances to perform calculations using logarithms of a different base 
{e.g., the logarithm of base 10 or of base 2) which may, for example, be simpler to calculate. 
The logarithmic terms in Equations 6.7 and 6.8, as well as in the other formulas and equations set 
forth in this document, may be readily adapted to such other forms by simply making an 

10 appropriate adjustment to the coefficient(s); more specifically by multiplying the coefficient(s) 
by an appropriate factor. Those skilled in the art will be able to readily obtain or determine the 
appropriate factor(s) and make the necessary adjustment to the logarithmic coefficient(s). 
Accordingly, it is understood that versions of these equations which use logarithms of other 
bases are mathematically equivalent to the equations and formulations set forth in this 

15 application, and merely provide alternative representations or descriptions of the algorithms and 
computational methods of this invention. Indeed, those skilled in the mathematical arts will 
appreciate that the equations and formulas set forth throughout this application may be written or 
expressed in a variety of different ways that are mathematically equivalent. Such mathematically 
equivalent expressions merely represent alternative representations or descriptions of the 

20 computational methods that they describe rather than any departure from those methods, 

Values for the coefficients k and b were selected for Equations 6.7 and 6.8, above, 
that optimized the fit of experimentally determined T m values (see, Table I, supra) to those 
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equations. First, the experimentally determined T m values were fit to a form of those equations 
in which the optional second order polynomial term (i.e., the term b x (ln 2 [Na + ] ( i) - ln 2 [Na + ] (0 )) ) 
was omitted (i.e., 6 = 0). A coefficient k was selected that is a linear function of the nucleic acid 
G-C content: 

5 

k(f(G~ C))= m f(G- C)+k 0 (Equation 6.9) 

Constant values for the coefficients m and k 0 were selected that optimized the 
goodness of fit. In a second analysis, the experimentally determined T m values were fit to a form 

10 of Equations 6.7 and 6.8 that included the optional second term (b * 0). The linear form of 

coefficient k set forth in Equation 6.9 was again used, and constant values for the coefficients m, 
ko and b were selected to optimize goodness of fit. 

In each analysis, a reference salt concentration of [Na + ] (0 ) = 1.02 M was used, and 
the reference melting temperature, T m (0) was the oligomer's experimentally determined melting 

15 temperature at that cation concentration. Goodness of fit was evaluated from the reduced "chi- 
square" value (& 2 = #Vv) and from <|A J^j^^ , the average difference between the measured T m 
values and corresponding T m values predicted using the equation. The chi-squared goodness of 
fit test compares a theoretical distribution with the observed data from a sample. (See William H. 
Press et al., Numerical Recipes in C: The Art of Scientific Computing 659-61 (2d Ed. 1992)). 

20 Thus, smaller values for (% 2 and/or <\AT m \> AVE indicate that the equation or model used 

accurately and reliably predicts actual melting temperatures for different salt concentrations. 

Coefficient values for each fit of the experimental data in Equations 6.7 and 6.8 are 

provided in Table II, below, along with each fit's reduced chi-squared and <\&T m \> AVE values. 

-50- 

{M:\3988\100k297usl\00051867.DOC UIIIIUBHBIDIIIIlin } 



TABLE II: Empirical Fits of Experimental T m Values 



Equation 


Coefficient 






Fit Quality 




k = m/{G-C) + k 0 




<\AT m \> AVE 


m 




b 


6.8 


4.29x1 0" 5 


-3.95x1 0' 5 


9.40x1 0" 6 


4.4 


0.5 


6.7 


-4.62 


4.52 


-0.985 


9.9 


0.7 


6.8 


3.85xl0" 5 


-6.18xl0" 5 


-0- 


19.5 


1.1 


6.7 


-3.22 


6.39 


-0- 


21.8 


1.2 



The date from Table I, supra, were also fit to Equations that have been previously 
5 described for estimating salt effects on polynucleotide melting temperatures. Each of those 
formulas is also discussed herein above, in Example 2. 

More specifically, the data in Table I were fit to a form of Equation 6 A, supra 
using a coefficient m = 16.6 (described by Schildkraut et ah, Biopolmers 1965, 3:195-208) and 
also to a form of that equation with the coefficients m = 12.5 (See, SantaLucia et al 9 
10 Biochemistry 1996, 35:3555-3562). In addition, the data were also fit to Equation 6.6, supra. 

As before, each analysis used a reference salt concentration [Na + ]( 0 ) = 1.02 M. 
The reference melting temperature, r w (0) was the oligomer's experimentally determined melting 
temperature. The enthalpy term in Equation 6.6 (i.e., AH°) was calculated for each 
oligonucleotide using the nearest neighbor model and parameters optimized for 1 M Na + 
15 concentration (SantaLucia et al Proc. Natl Acad. Set 1998, 95:1460-1465). 

-51- 

{M:\3988\100k297usl\0005 1867.DOC IMIllMlinEiailMBI } 



Goodness of fit was evaluated from the reduced "chi-square" value (% 2 = x*/v) and 
from <\AT m \> A y E , the average difference between the measured T m values and corresponding T m 
values predicted using the equation. The reduced chi-squared and <\&T m \>AVE values for 
Equations 6.1 and 6.6 are provided in Table III, below. 



TABLE III: EMPIRICAL FITS OF EXPERIMENTAL T m VALUES 
TO PRIOR ART EQUATIONS 

Equation Xl\ <\M m \> AVE 

6^6 443 L7 

6.1(m=16.6) 337.7 5.1 

6.1 (w=12.5) 68.4 2.1 



These results confirm that melting temperatures of polynucleotides may be more accurately and 
reliably estimated by using formulas such as Equations 6.7 and 6.8, above, that depend on 
nucleotide content. 

Thus, four new formulas are provided here which relate the melting temperature of a 
nucleic acid to the salt conditions in which the nucleic acid denaturation is actually performed: 



T m - T m ° + (6.39- 3.22 x f(G- Q)x ln-^prf (Equation 6.10) 

Jo 



= + (3.85 x f(G - O - 6.1 8) x 1 (T 5 x In (Equation 6.11) 

1 m 1 m i" a Jo 
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T = 

m 




(Equation 6.12) 



-0.985x {ln 2 [M + ]-ln 2 [^] (0) } 




(Equation 6.13) 



+9.40x 10" 6 x {ln 2 [^Va + ]-ln 2 [^ + ] (0) } 



5 Any one of these formulas may be used in connection with the present invention, e.g., to predict 
the melting temperature of a particular nucleic acid in certain salt conditions. Indeed, the data 
presented in Table II, and Table III supra, shows that these equations and algorithms may predict 
or estimate the melting temperature of a particular nucleic acid with greater accuracy and 
reliability than existing methods. 



publications, are cited and discussed in the description of this invention. The citation and/or 
discussion of such references is provided merely to clarify the description of the present 
15 invention and is not an admission that any such reference is "prior art" to the invention described 
herein. All references cited and discussed in this specification are incorporated herein by 
reference in their entirety and to the same extent as if each reference was individually 
incorporated by reference. 



10 
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